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1.  Introduction 


The  feedback  control  system  considered  is  shown  in  Fig.  1.1.  This  system  is 
referred  to  as  a  3D0F  feedback  system  because  of  the  three  real  rational  transfer 
matrices  Cu(s ),  Cw(s),  and  Cz(s)  which  can  be  chosen  in  the  design  of  the  controller. 
When  no  feedforward  transducer  is  available,  Cz(s)  —  0  and  the  feedback  system 
reduces  to  a  2DOF  system.  Optimal  and  suboptimal  designs  for  the  system  shown  in 
Fig.  1.1  are  treated  in  [1]  and  form  the  basis  for  the  work  presented  here.  In  fact,  the 
notation  used  here  is  consistent  with  that  used  in  [1]  for  easy  reference.  The  transfer 
matrices  given  in  the  blocks  of  Fig.  1.1  are  real  and  rational.  The  matrices  Ft(s)  and 
T0(s)  are  n  x  n,  P(s)  is  nxm,  Lt(s )  is  l  x  r,  and  Po(s)  is  n  x  r.  The  dimensions  of 
all  other  vectors  and  matrices  are  taken  to  be  compatible  with  these. 


Figure  1.1:  The  3DOF  system. 

The  Wiener-Hopf  design  philosophy  for  two  and  three-degree-of-freedom  (2D0F 
and  3D0F)  multivariable  control  systems  is  described  in  [1-3].  In  these  works,  a 
quadratic  cost  functional  is  postulated  to  measure  system  performance.  The  cost 
functional  takes  into  account  such  issues  as  tracking  error,  saturation,  and  sensitivity 
to  small  parameter  variations.  The  main  contribution  of  this  earlier  work  is  the 
identification  of  the  subset  of  all  stabilizing  controllers  for  which  the  cost  functional 
is  finite.  This  subset  is  parameterized  for  two- degree- of-freedom  (2D0F)  systems  in 
terms  of  two  real  rational  matrix  parameters  each  of  which  is  strictly  proper  and 
analytic  in  the  closed  right  hand  side  of  the  complex  plane.  An  identical  result  holds 


3 


for  three-degree-of-freedom  (3D0F)  systems  except  that  the  parameterization  is  in 
terms  of  three  rather  than  two  such  matrices.  Since  the  2D0F  system  can  be  treated 
as  a  special  case  of  the  3D0F  system  [1],  the  ideas  here  are  first  presented  and 
developed  for  the  3D0F  case.  The  results  for  the  2D0F  case  are  then  easily  derived. 

The  cost  functionals  are  minimized  in  both  the  2D0F  and  3D0F  cases  when  the 
matrix  parameters  are  chosen  to  be  null  matrices.  When  non-null  choices  are  made  for 
any  of  the  matrix  parameters,  an  increase  in  the  cost  functional  results  which  is  related 
to  the  matrix  parameters  through  simple  formulas.  Similar  results  are  available  in  [4], 
[5]  for  the  so-called  generalized  plant  configuration.  In  this  configuration,  all  system 
components  other  than  the  controller  are  represented  by  the  generalized  plant.  This 
point  of  view  is  especially  appealing  to  those  who  wish  to  concentrate  on  solving 
the  mathematical  problems  associated  with  controller  design.  Their  results  are  then 
applicable  to  all  possible  feedback  system  configurations.  On  the  other  hand,  it  is 
also  useful  for  engineers  to  have  design  formulas  available  for  a  specific  configuration 
whose  structure  has  wide  application.  Such  a  structure  is  the  one  employed  in  the 
3D0F  system.  Indeed,  it  is  hard  to  imagine  any  control  system  applications  which  are 
not  included  in  the  3D0F  case,  although  a  4D0F  system  involving  the  generation 
of  a  signal  to  be  used  for  diagnostic  purposes  is  described  in  [6].  For  the  3D0F 
structure,  the  relationship  of  the  matrix  design  parameters  to  important  engineering 
issues  can  be  exploited.  In  particular,  those  matrix  parameters  which  are  connected 
to  system  robustness  can  be  used  to  trade-off  optimal  nominal  system  performance 
against  improved  robustness  with  respect  to  plant  uncertainty.  A  straightforward 
methodology  for  achieving  this  trade-off  is  described  in  this  paper. 

The  direct  nature  of  the  methodology  employed  here  together  with  the  careful 
attention  paid  to  the  increase  in  the  value  of  the  original  quadratic  cost  functional 
contrasts  with  the  design  point  of  view  employed  in  the  loop  transfer  recovery  (LTR) 
methodology  [7-14].  Specifically,  the  cost  functional  is  viewed  here  as  a  truly  in¬ 
dicative  measure  of  nominal  system  performance  and  not  one  whose  weights  can  be 
manipulated  to  achieve  desired  engineering  goals.  Instead,  the  matrix  parameters  are 
viewed  as  the  objects  to  be  manipulated  and  the  cost  increments  associated  with  any 
such  manipulation  is  kept  to  the  forefront.  In  addition,  a  one-step  process  instead 
of  a  two-step  process  is  used  to  find  the  unique  choice  for  the  matrix  parameters. 
This  unique  choice  is  the  solution  of  a  constrained  optimization  problem.  That  is, 
one  in  which  a  quadratic  or  H2  measure  of  the  stability  margin  is  optimized  for  a 
prescribed  increment  in  the  cost  functional.  Moreover,  no  special  considerations  are 
needed  when  the  plant  transfer  matrix  is  nonminimum  phase  and/or  sensor  noise  as 
well  as  disturbance  inputs  are  present.  Also  the  design  procedure  begins  with  coprime 
polynomial  matrix  fraction  descriptions  for  the  plant  and  sensors  and  can  be  carried 
out  without  state  variable  models  for  these  system  components. 

Justification  for  the  point  of  view  adopted  here  is  based  on  the  following  obser- 
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vations.  Foremost  in  the  design  of  advanced  control  systems  is  the  need  to  satisfy 
performance  specifications.  In  fact,  one  often  sacrifices  stability  margin  or  accepts 
conditionally  stable  systems  in  order  to  realize  superior  performance.  So  one  should 
begin  with  an  optimal  #2  solution.  When  this  solution  exceeds  specifications,  the  cost 
functional  can  be  increased  and  the  increment  traded  for  an  improvement  in  stability 
margin.  That  is,  one  should  maximize  stability  margin  subject  to  a  constraint  on 
the  H2  norm  cost  increment.  When  the  stability  margin  obtained  in  this  way  is  not 
adequate,  then  one  must  use  a  different  controller  for  different  operating  profiles  of 
the  plant:  typically,  one  should  not  give  up  performance  just  for  the  sake  of  operating 
with  a  single  fixed  controller  when  high  performance  is  required. 

In  a  recent  paper  [15],  stability  margin  is  addressed  in  a  quadratic  cost  setting 
by  including  an  additional  term  that  accounts  for  stability  margin  in  the  minimized 
cost  functional.  This  approach,  however,  was  not  fully  explored  in  [15]  and  does  not 
provide  the  perspective  gained  here.  Here,  instead,  suboptimal  performance  is  linked 
to  stability  margin  through  some  non- null  choices  of  the  matrix  parameters.  In  this 
way,  the  trade-off  being  made  becomes  transparent  and  important  design  formulas 
are  obtained. 

The  use  of  a  quadratic  cost  functional  as  an  indicator  of  control  system  perfor¬ 
mance  for  systems  subjected  to  both  stochastic  and  shape-deterministic  inputs  has 
been  a  long  accepted  practice  in  the  control  field  and  needs  no  justification  here 
[16,17].  Stability  margin,  on  the  other  hand,  is  more  correctly  measured  by  an  H ^ 
norm  [18].  So  the  real  problem  involves  the  application  of  an  f/2  norm  on  an  in¬ 
dicator  of  system  performance  and  the  application  of  an  Hoo  norm  on  an  indicator 
of  stability  margin,  with  the  indicators  in  each  case  being  different  functions  of  the 
same  matrix  parameter.  That  is,  the  real  problem  to  be  solved  is  a  mixed  H2/-H00 
problem  [19-29].  The  approaches  in  the  literature  for  the  resolution  of  this  type  of 
problem  have  typically  focused  on  minimizing,  subject  to  an  H <*,  norm  constraint,  a 
H2  norm.  Except  for  [25],  analytical  solutions  are  not  available.  In  [25],  however, 
the  generalized  plant  configuration  is  considered  with  state  feedback.  This  case  is 
impractical  since  not  only  is  the  state  of  the  original  physical  plant  required,  but  also 
the  states  of  all  the  sensors.  Moreover,  it  is  argued  below  that  when  the  H <*,  norm  is 
used  to  measure  stability  margin,  it  is  the  reverse  problem  that  is  more  appropriate. 
That  is,  one  should  instead  minimize  the  norm  subject  to  a  constraint  on  the  ff2 
norm.  So  it  is  believed  that  the  preliminary  efforts  in  [30]  to  trade-off  excess  stability 
margin  in  the  form  identified  in  [18]  in  order  to  minimize  a  quadratic  performance 
measure  is  not  headed  in  the  right  direction. 

Unfortunately,  minimizing  the  H0 0  norm  subject  to  a  constraint  on  the  H2  norm  is 
also  a  difficult  problem  for  which  analytical  solutions  are  not  expected.  In  this  paper, 
the  difficulties  are  circumvented  by  choosing  a  meaningful,  although  nonrigorous, 
quadratic  measure  rather  than  an  measure  of  stability  margin.  This  is  in  keeping 
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with  the  use  of  a  quadratic  measure  for  saturation  which  is  also  approximate.  In 
addition,  in  contrast  with  [18]  where  coprime  rational  matrix  fraction  descriptions 
for  the  plant  are  used,  here  polynomial  matrix  fractions  are  used  and  uncertainty 
is  modelled  as  perturbations  in  the  coefficient  matrices  of  these  coprime  polynomial 
matrices.  It  is  believed  that  this  is  a  more  natural  way  to  model  structural  uncertainty 
in  practice. 

By  working  with  an  H2  norm  for  both  measures  of  performance  and  stability 
margin,  a  key  result  in  [31]  can  be  employed  and  one  obtains  a  frequency-domain 
analytical  solution  for  the  design  problem  to  within  the  choice  of  a  single  scalar  La- 
grangian  multiplier.  It  is  anticipated  that  this  approach  will  often  lead  to  satisfactory 
designs  or  at  least  good  initial  designs  from  which  refinements  can  be  explored.  In¬ 
deed,  it  is  possible  to  view  the  design  problem  treated  here  as  a  trade-off  among  Pareto 
optimal  solutions  for  a  multicriterion  optimization  problem  [32].  Certainly,  the  ro¬ 
bustness  analysis  tools  described  in  [33]  would  also  play  a  major  role  in  the  evaluation 
and  modification  of  the  control  systems  synthesized  using  the  methodology  presented 
here. 

There  is  a  danger  in  using  an  H 2  norm  instead  of  an  II norm  for  stability 
margin.  In  [34],  for  a  class  of  scalar  systems,  it  is  established  that  the  H ^  norm  as  an 
indicator  of  system  performance  can  be  substantially  larger  than  its  minimum  value 
when  the  free  matrix  parameter  is  chosen  to  be  the  one  which  minimizes  the  H2  norm 
of  this  same  indicator.  Since  the  minimum  norm  solutions  are  typically  overly 
conservative  [35,36]  when  uncertainties  are  structured,  and  since  the  properties  of  the 
function  whose  Hqo  norm  was  investigated  in  [34]  differ  somewhat  from  the  properties 
of  the  function  used  here  as  an  indicator  of  stability  margin,  it  is  expected  that  the 
potential  danger  of  working  with  an  H2  norm  is  mitigated. 

The  H2  norm  trade-off  problem  of  interest  here  can  also  be  approached  numer¬ 
ically  using  convex  programming  along  the  lines  taken  in  [29,32,37,38]  or  indirectly 
through  linear  matrix  inequalities  (LMI)  as  described  in  [39,40].  The  former  involves 
approximating  the  infinite-dimensional  set  for  the  real,  rational,  strictly-proper  ma¬ 
trix  design  parameter  with  a  sequence  of  larger  and  larger  finite-dimensional  subsets. 
The  latter  uses  convex  programming  to  solve  the  LMI  associated  with  an  optimal  fixed 
order  controller.  So  to  approximate  the  infinite-dimensional  problem,  a  sequence  of 
controllers  of  increasing  order  would  have  to  be  studied.  Clearly,  the  analytical  result 
given  in  the  sequel  serves  as  an  important  complement  to  these  numerical  optimiza¬ 
tion  approaches.  This  analytical  result  also  provides  a  guide  for  choosing  a  good 
starting  point  when  these  numerical  approaches  are  used  to  search  for  the  solution  to 
the  underlying  mixed  H2  /ifoo  problem,  should  this  step  be  necessary. 

All  results  in  this  paper  are  developed  by  working  exclusively  in  the  complex  s- 
plane  and  all  matrix  functions  of  s  =  <7  +  ju>  are  assumed  to  be  real  and  rational.  The 
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real  part  of  s  Is  denoted  by  Re  s.  For  any  matrix  G(s),  the  notations  G\  G,  G*,  det  G, 
and  TrG  are  used  for  the  transpose,  conjugate,  conjugate  transpose,  determinant,  and 
trace  of  G(.s),  respectively.  The  matrix  G*(s)  is  the  conjugate  transpose  of  G(—s)  or 
G*(s)  —  G*(—s)  which,  for  real  rational  matrices,  reduces  to  G*(s)  =  G  (— s ).  A  real 
matrix  G(s)  is  called  para-Hermitian  when  G  —  G».  A  diagonal  matrix  G  with  gi 
in  the  i-row,  i-column,  i=l  — *■  n,  is  denoted  by  G=diag{<h,  <72...,<7„}.  The  Kronecker 
product  of  two  matrices  is  denoted  as  G®R  and  is  the  matrix  whose  ij-block  is  given 
by  gsjR.  The  vector  g=vec  G=[g[  g'2  ■•■g'n]'  is  formed  by  stacking  all  the  columns  of 
the  matrix  G=[<7i<72  -■■gn]-  As  is  evident  from  the  above,  function  arguments  are  often 
omitted  for  brevity  when  no  confusion  is  possible.  The  notation  G(s)  <  0(sl/)  means 
that  no  entry  in  G(s)  grows  faster  than  sv  as  s  —*■  oo.  In  the  partial  fraction  expansion 
of  G(s),  the  contributions  made  by  all  its  finite  poles  in  Re  s  <  0,  Re  s>0,  and  by 
its  poles  at  s  =  oo  are  denoted  by  {G}+,  {G}_,  and  {(j}oo,  respectively.  Clearly, 
{G}+  is  analytic  in  Re  s>0,  {G}_  in  Re  s  <  0  and  both  are  <  0(s-1).  The  identity 
matrix  is  denoted  by  I.  A  positive  definite  (positive  semidefinite)  matrix  G  is  denoted 
by  G  >  0  (G  >  0).  A  matrix  G  is  said  to  be  good  when  it  is  analytic  on  J,  the 
finite  part  of  the  s  =  jcj-axis.  The  normal  rank  of  a  matrix  G(s),  denoted  '/(G),  is 
the  highest  order  of  all  non-identically-zero  minors  of  G(s).  When  needed  for  clarity 
vertical  and/or  horizontal  lines  are  used  to  identify  the  partitions  of  a  matrix.  The 
expectation  operator  is  denoted  by  £{}. 


2.  Summary  of  Available  Results 


The  Wiener-Hopf  design  of  optimal  3DOF  systems  and  the  parameterization  of 
all  suboptimal  designs  which  is  presented  in  [1]  is  summarized  in  this  section.  In  this 
regard,  it  is  convenient  to  introduce  the  three  closed-loop  transfer  matrices 

Ru  =  (I  +  CwFtP)~lCu, 

Rw  =  (I  +  CwFtP)-1Cw, 

and 

Rz  =  (I  +  CwFtP)~1Cz.  (2.3) 

Also  essential  in  this  work  are  several  coprime  polynomial  matrix  fraction  descrip¬ 
tions.  Specifically,  one  can  always  write  [41] 

FtP  =  B1Ai1  =  A~lB ,  (2.4) 


(2.1) 

(2.2) 
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where  By ,  Ay  is  a  right  and  A,  B  is  a  left  coprime  pair  of  polynomial  matrices  for 
which  there  exist  real  polynomial  matrices  X ,  Y,  Xx,  and  Y\  so  that 


A1Y1  =  YA, 

(2.5a) 

AX  +  BY  =  /, 

(2.5b) 

XyAy  +  YyBy  =  /, 

(2.5c) 

detXdetX \  ^  0. 

(2.5d) 

It  is  a  consequence  of  Theorem  1  in  [1]  that  any  Cw  associated  with  a  stabilizing 
controller  is  of  the  form 


Cw  =  (Xy  -  I<xB)-\Yy  +  KyA)  (2.6) 

where  Ky  is  a  real  rational  matrix  analytic  in  Re  s>0.  It  then  easily  follows  from 
(2.2)  that 

RW  =  A1(Y1  +  I<1A)  (2.7) 


and 


I  -  RwFtP  =  (/  +  CvFtP)-1  =  Ay{Xy  -  KyB ).  (2.8) 

The  cost  functional  E  which  takes  tracking  error,  saturation,  and  sensitivity  into 
account  was  introduced  in  [1,2]  and  is  as  follows: 

1  rj°°  h  C3°° 

E  =  —  Tr(Ge(s))ds  +  —  /  Tr(Q(s)Gr(s))ds 

Z7TJ  J—joo  Z7VJ  J  —joo 

+  7T-  r  Tr(SGs{s)S*)ds  +  r  Tr{kQRwFtG sFuRw*)ds  (2.9) 

Zl VJ  J —joo  Z/7T J  J—joo 

where  Ge ,  Gr,  and  Gs  are  the  spectral  densities  of  the  error  e(s)  =  To(s)u(s)  —  y(s), 
the  input  r(s),  and  the  plant  uncertainty  (see  equation  (2.18)),  respectively.  The 
constants  k  and  /i  are  nonnegative  and  Q(s)  is  a  para-Hermitian  nonnegative  definite 
weighting  matrix.  The  matrix  S(s)  =  I  —  PRwFt  is  a  closed-loop  sensitivity  matrix. 

The  expression  for  E  in  (2.9)  decomposes  into 
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E  —  Eu  +  Ewz 


(2.10) 


in  which 

I  rd°° 

Eu  —  - — :  /  [Tr{kQRuGuRu*  +  (P*P  +  kQ)RuGnRu*} 

Zir J  J—joo 

+  Tr{(T0-PRu)Gu(T0-PRu)*})ds  (2.11) 

includes  all  terms  involving  Ru  and 

1  r3°° 

Ewz  =  - — 7  /  Tr[kQ(RwFtPo  +  RzLt)Gd{RwEtPo  +  RzLt)* 

2,7TJ  J—joo 

+  (P*P  +  kQ)(RwGmRw *  +  RzGiRz *)  +  kQ RwFtfJ-G sFt*Rw*]  ds 

1  ri°° 

+  7^  /  Tr[(P0  -  PRwFtP0  -  PRzLt)Gd(P0  -  PRwFtP0  -  PRzLt ). 

2i7TJ  J—joo 

+  (I-  PRwFt)fiGs(I  -  PRwFt ),]  ds  (2.12) 

all  those  involving  Rw  and  Rz.  In  (2.11)  and  (2.12),  Gd,  Gi,  Gm ,  Gu ,  and  Gn  are 
specified  spectral  densities  for  the  signals  d(s),  l(s),  m(s),  u(s ),  and  n(s)  respectively, 
which  also  account  for  modelling  of  any  shape-deterministic  components  in  the  signals 

[4]- 

Clearly,  Eu  is  a  nonnegative  functional  of  Ru  and  Ewz  is  a  nonnegative  functional 
of  Rw  and  Rz.  So  the  minimization  of  E  is  equivalent  to  two  separate  minimization 
problems:  the  minimization  of  Eu  with  respect  to  Ru  and  the  minimization  of  Ewz 
with  respect  to  the  pair  R w,  Rz.  Also  of  interest  is  the  parameterization  of  the  set  of 
all  Ru,  Rw,  and  Rz  associated  with  an  asymptotically  stable  3DOF  system  possessing 
a  finite  Eu  and  a  finite  Ewz.  In  this  regard,  the  two  definitions  which  follow  are 
introduced. 

Definition  1:  The  real  rational  matrices  Ru,  Rv, ,  and  Rz  are  said  to  be  acceptable 
for  the  given  Ft,  P  if  there  exists  a  controller  which  realizes  them  as  the  designated 
closed-system  transfer  matrices  of  an  internally  asymptotically  stable  configuration 
of  the  generic  type  shown  in  Fig.  1.1. 

Definition  2:  The  acceptable  matrices  Ru ,  Rw ,  and  Rz  are  called  admissible  if  they 
yield  finite  cost  E.  An  admissible  controller  is  one  which  realizes  an  admissible  triple 
Ru ,  Rw ,  and  Rz. 

The  admissible  Ru  for  which  Eu  is  minimized  is  designated  by  Ru.  Similarly,  the 
admissible  pair  Rw ,  Rz  for  which  Ewz  is  minimized  is  designated  by  Rw,  Rz.  Specific 
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formulas  for  Ru ,  Rw,  and  Rz  are  given  in  [1].  The  associated  minimum  values  for  Eu 
and  Ewz  are  accordingly  indicated  by  Eu  and  Ewz,  respectively. 

It  is  evident  from  (2.2)  that  Cw  is  determined  from  Rw  by  the  formula 


Cw  =  {I-RwFtP)~1RVJ.  (2.13) 

So  there  is  a  one-to-one  correspondence  between  Cw  and  Rw.  Clearly,  only  Cw  impacts 
on  the  stability  margin  of  the  feedback  loop.  Hence,  no  further  consideration  is 
given  here  to  Eu  and  Ru.  Interested  readers  are  referred  to  [42,43]  where  suboptimal 
choices  of  Ru  are  investigated  in  connection  with  the  design  of  decoupled  multivariable 
systems. 

Before  one  can  describe  in  detail  the  results  obtained  in  [1]  that  are  needed  here, 
two  Wiener-Hopf  spectral  factorizations  have  to  be  defined.  In  the  first,  the  matrix 
A  is  a  Wiener-Hopf  spectral  solution  (i.e.,  A  and  A-1  are  analytic  in  Re  s>0)  of  the 
equation 


Ai*(P*P  4-  kQ)A\  —  A*A. 


(2.14) 


The  second  Wiener-Hopf  spectral  factorization  involves  the  matrices 


A  = 


A 

0 


0 

I 


and 


$ 


FtGdsFt*  +  Gv 
LtGdP0*Fu 


FtPoGdLu 
LtGdLt*  +  Gi 


where  the  matrix  Gds  is  given  by 


Gds  —  PoGdPo*  +  nGs 


and 


Gs  =  £{{6P)(8P)*}  +  £{{8P0)(8Po)*} 


(2.15) 


(2.16) 


(2.17) 


(2.18) 


is  a  probabilistic  model  for  first-order  perturbations  8P  and  8Pq  in  the  plant  transfer 
matrices  P  and  Po,  respectively.  The  matrix  Gs  appears  in  the  terms  included  in 
the  cost  functional  E  which  take  into  account  the  sensitivity  of  the  tracking  error  e 
and  plant  input  r  to  plant  parameter  changes.  In  the  second  Wiener-Hopf  spectral 
factorization  needed,  the  matrix  fl  is  a  Wiener-Hopf  spectral  solution  of  the  equation 
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=  nn*.  (2.19) 

A  set  of  seven  underlying  assumptions  cited  below  are  introduced  in  [1]  for  which 
the  following  holds. 

(i)  The  set  of  all  acceptable  Rw  and  Rz  that  yield  a  finite  Ewz  is  generated  by  the 
formula 

[RW\RZ]  =  [RW\RZ]  +  AiA-'iZ^Zjn-'A  (2.20) 

where  Zw  and  Zz  can  be  any  real  rational  matrices  analytic  in  Re  s>0  and  satisfying 
<  0{s~%  Zz  <  0(s~l). 

(ii)  The  cost  functional  Ewz  is  given  by 

Ew  z  =  Ewz  +  A  Ewz  (2.21) 


where 


1  p°° 

A  Ewz  =  —  /  Tr[ZwZw ,  +  ZzZz*]ds.  (2.22) 

Z7TJ  J  —joo 

Clearly,  the  choice  Zw  =  0,  Zz  =  0  leads  to  optimum  performance.  When  optimum 
performance  exceeds  specifications,  there  exists  a  number  N  for  which  all  allowed 

zw  =  [Zw\Zz]  (2.23) 

satisfying 

I  rioo 

A Ewz  —  — — 7  /  Tr[ZwzZwz*\ds  <  N  (2.24) 

27TJ  J —joo 

can  be  considered  for  design  trade  offs.  Here,  in  particular,  that  allowed  Zwz  satisfying 
equation  (2.24)  is  sought  for  which  a  stability  margin  measure  is  minimized.  The 
purpose  of  this  paper  is  to  describe  a  mechanism  for  achieving  this  goal. 

The  set  of  seven  underlying  assumptions  mentioned  above  also  insure  the  existence 
of  an  admissible  Ru  so  that  a  finite  E  is  guaranteed  to  exist.  This  assures  that 
the  problem  addressed  here  is  a  meaningful  one.  These  seven  assumptions  include 
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conditions  which  also  involve  the  specified  spectral  densities  Gu  and  Gn  for  the  signals 
u(s)  and  n(s),  respectively.  For  easy  reference,  these  seven  assumptions  follow. 

Assumption  1:  All  system  blocks  are  free  of  hidden  poles  in  Re  s>0.  The  transfer 
matrix  Lt( s)  and  any  portion  of  Po(s)  arising  from  a  block  outside  the  feedback  loop 
are  analytic  in  Re  s>0.  For  any  finite  pole  so  of  P  or  Ft  in  Re  s>0,  the  McMillan 
degree  of  s0  as  a  pole  of  FtP  is  equal  to  the  sum  of  its  McMillan  degrees  as  a  pole  of 
P  and  Ft. 


Assumption  2:  The  matrix  To(s)  is  proper  and  the  spectral  density  Gu(s )  vanishes 
at  least  as  fast  as  1/s2,  as  s  — *•  oo,  i.e.,  Gu(s)  <  0(s~2). 

Assumption  3:  T0  and  Ft  are  analytic  and  nonsingular  on  the  finite  part  of  the 
s  =  ju  axis.  In  addition,  To  is  analytic  in  Re  s>0  and  (ToFt  —  I)P  is  analytic  on  the 
finite  part  of  the  s  =  ju  axis. 

Assumption  4:  AGUA *,  Q ,  Gn,  and  ( Gu+  Gn )-1  are  good  (i.e.,  analytic  on  the 
finite  part  of  the  s  =  ju  axis). 

Assumption  5:  AU(P*P  +  kQ)A\  is  nonsingular  on  the  finite  part  of  the  s  =  ju 
axis. 

Assumption  6:  The  matrix  A$A*  is  analytic  and  nonsingular  on  the  finite  part  of 
the  s  =  ju  axis. 

Assumption  7:  The  plant-uncertainty  spectral  density  Gs(s)  and  Po(s)Gd(s)Po*(s) 
are  both  <  0(s~2),  and  Gm(s)  is  analytic  on  the  finite  part  of  the  s  =  ju  axis.  In 
addition  FtP  <  (9(s-J/°),  <J>-1(s)  <  0(s~2ui),  and  (P*P  +  kQ)~l  <  0{s~2v2),  where 
+  ^2  >  0. 

An  additional  assumption  which  plays  an  essential  role  in  the  sequel  is  now  intro¬ 
duced. 

Assumption  8:  v\  +  i/2  +  1  >  0  . 

It  is  now  possible  to  state  an  important  lemma  which  is  an  immediate  consequence 
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of  Lemma  2  in  [1]. 


Lemma  1:  When  Assumptions  7  and  8  hold,  then  f l-1  A  <  0(s~Vl),  AiA-1  < 
0(s~U2),  and  all  admissible  Rw ,  Rz  satisfy  [Ry,  |  Rz]  <  0(s~(i'1+"2+1))  which  implies 
that  Rw,  Rz  are  proper. 

It  is  also  convenient  here  to  introduce  the  following  definition. 

Definition  3:  Any  pair  of  transfer  matrices  Ft,  P  that  satisfy  Assumption  1  are 
called  admissible. 


3.  Stability  Margin  Considerations  and  Problem  Formulation 


The  transfer  matrices  shown  in  Fig.  1.1  are  nominal  transfer  matrices.  Except  for 
the  plant,  it  is  assumed  that  the  nominal  and  actual  transfer  matrices  of  the  system 
components  are  the  same.  The  plant,  however,  has  an  actual  transfer  matrix  which 
differs  from  the  nominal  one.  In  this  paper,  there  are  two  differences  between  the 
nominal  plant  transfer  matrix  and  the  actual  plant  transfer  matrix  that  are  of  interest. 
The  difference  8P  in  (2.18)  represents  the  small  uncertainty  of  the  plant  when  it  is 
operating  normally  near  nominal  conditions.  The  difference  8P  is  introduced  in  the 
sequel  to  characterize  large  perturbations  associated  with  abnormal  operation  that 
could  lead  to  instability.  The  overall  objective  here  is  to  design  the  controller  so  that 
with  normal  system  performance  a  specification  on  the  value  of  the  cost  functional 
Ewz  is  met,  while  providing  the  maximum  tolerance  against  instability  arising  from 
the  perturbations  8P.  In  this  section,  attention  is  focused  on  the  question  of  stability 
and  the  impact  of  8P  on  this  issue.  In  this  regard,  the  right  coprime  polynomial 
matrix  fraction  description 

Ft(P  +  8P)  =  (£*  +  8Bi)(Ai  +  8AJ-1  (3.1) 

is  introduced  to  define  the  polynomial  perturbations  8A\  and  8B\. 

When  Assumption  1  holds  and  the  pair  Ft ,  (P+8P)  is  admissible,  it  is  not  difficult 
to  show  [1]  that  the  system  is  internally  asymptotically  stable  for  perturbations  8A\ 
and  8B\  if,  and  only  if, 

<t>8  =  det[I  +  (Ax  -  Ax5)6Ax  +  (Y1  +  K1A)8B1]  (3.2) 
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is  free  of  zeros  in  Re  s>0.  Using  (2.7)  and  (2.8),  (3.2)  may  immediately  be  written  as 


(j>s  =  det{I  +  A1 1 


S  Ru 


8  Ay 
8By 


(3.3) 


where 

S  =  I-  RwFtP  (3.4) 

is  a  loop  sensitivity  matrix  on  account  of  (2.8). 

Since  all  coprime  polynomial  matrix  pairs  associated  with  matrix  fraction  descrip¬ 
tions  of  the  same  rational  matrix  differ  from  one  another  only  by  a  unimodular  matrix 
multiplier  [44,45],  the  results  cited  above  are  independent  of  the  particular  choice  of 
matrix  fraction  descriptions  used  (e.g.,  see  the  final  paragraph  in  the  Appendix  of 
[2]).  However,  it  is  essential  here  that  Ay  satisfies  the  following  assumption. 

Assumption  9:  The  polynomial  matrix  Ay  is  column  reduced  and  Ay1  is  strictly 
proper. 


Assumption  9  is  not  overly  restrictive.  In  fact,  given  any  coprime  matrix  fraction 
description  BA -1,  one  can  find  a  unimodular  matrix  U  so  that 

Air1  =  {ByU){AyU)~l  =  ByA- 1  (3.5) 

where  Ay  is  column  reduced  [44,45].  So  there  is  no  loss  in  generality  in  assuming  that 
Ay  is  column  reduced.  It  is  also  true  in  almost  all  cases  of  practical  interest  that  Ay 1 
is  strictly  proper  when  Ay  is  column  reduced.  Indeed,  one  has  the  following  lemma. 

Lemma  2:  When  FtP  =  By  Ay1  is  strictly  proper,  when  the  pair  By,  Ay  is  right 
coprime  with  Ay  column  reduced,  and  when  there  exists  no  constant  vector  a  ^  0 
for  which  FtPa  =  0,  then  Ay1  is  strictly  proper. 

Proof:  First,  it  is  established  that  the  column  degrees  of  Ay  must  all  be  positive. 
When  the  contrary  is  true,  there  is  at  least  one  column  of  Ay  which  has  zero  degree. 
This  column  then  contains  only  constants.  These  constants  cannot  all  be  zero  because 
det  Ay  ^  0.  This  constant  nonzero  vector  is  designated  by  a.  When  ( By)j  denotes 
the  j-column  of  By  and  a  is  the  j-column  of  Ay,  it  then  follows  that 


(By)j  =  FtPa. 


(3.6) 
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Since  FtP  is  strictly  proper, 

lim  (Bi)j  =  0.  (3.7) 

s— ►  oov  ,J 

But  ( B\)j  is  polynomial  and  the  only  way  (3.7)  can  be  true  is  if  (Bi)j  =  0.  It  then 
follows  from  (3.6)  that 


FtPa  =  0,  a.  ^  0  (3-8) 

which  contradicts  the  hypothesis.  It  is  now  a  simple  matter  to  prove  A f1  is  strictly 
proper.  Since  A\  is  column  reduced,  one  can  write 

A1  =  (A1)hcS  +  L  (3.9) 

where  ( A\)hc  is  a  nonsingular  constant  matrix, 

S  —  diag{sScl  ,sSc2 ,  ...,s5cm}  (3.10) 

with  6cj  being  the  column  degree  of  the  j-column  of  Ax,  and  the  column  degrees  of  the 
polynomial  matrix  L  are  less  than  the  corresponding  column  degrees  of  A\.  Clearly, 

lim  A T1  =  lim  S-'KA!)^  +  LS -1]-1  =  lim  S~\A1)^  =  0  (3.11) 

and  so  A^1  is  strictly  proper.  This  completes  the  proof. 


It  should  be  noted  that  lemmas  similar  to  Lemma  2  have  appeared  in  the  literature 
(e.g.,  see  Theorem  G-10  in  [44]  and  Lemma  6.3-11  in  [45]),  but  the  possibility  of  (3.8) 
was  overlooked.  A  simple  example  which  exposes  the  difficulty  is  easily  generated 
from 


cq  ^ 

1 - 

= 

'  0 

1 

1 

s 

L  x  J 

_  1 

2s  _ 

(3.12) 


In  this  case,  FtP  =  [  “•  7  I  is  strictly  proper;  however,  A^1  is  only  proper  even 

though  A\  is  column  reducea.  This  example  is  ruled  out  as  a  counterexample  in 
Lemma  2  because  it  satisfies  FtPa  =  0  for  a  =  [1  1]  . 
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It  should  be  emphasized  that  Lemma  2  provides  sufficient  conditions  for  Aj"1  to 
be  strictly  proper.  In  fact,  one  frequently  finds  that  A f1  is  strictly  proper  when  FtP 
is  only  proper  or  even  improper. 

The  modelling  of  the  perturbations  8A\,  8B\  is  considered  next.  Since  A\  is 
column  reduced,  one  can  always  write 

k 

A\  -  ^AuSi,  detAik  -  det(A-i)hc  ±  0  (3.13) 

i= 0 

where  the  elements  of  the  Alt  are  real  numbers, 


k  —  TYIQjX ( 8q\ ,  8q2 j  ■  •  • ,  8cni^  (3.14) 

and 

Si  =  diag{ss'',sSi2,...,sSim}  (3.15) 

with 

8ij  =  max{8Cj  -  Jc  +  i,  0}.  (3.16) 

It  should  be  noted  that  Sk  =  S.  Attention  is  restricted  here  to  perturbations  of  the 
form 


8Ai 

8BX 


=  ( 8V)S 


where 


8A\o  8AU  •  •  •  8Aitk-i 

8Bw  8B\\  ■  ■  ■  8Bi,k-i 


and 


(3.17) 


(3.18) 


s  =  (3.19) 

The  implication  of  this  modelling  is  treated  first.  Practical  justification  then  follows. 
Substituting  (3.17)  into  (3.3)  and  making  use  of 

det(7  +  G1G2 )  =  det(I  +  G2G1)  (3.20) 
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gives 


<f>s  =  det[I  +  SM(6V)} 


where 


M  =  Ar1 


S  Rv 


Now,  because  of  Assumptions  7  and  8  and  Lemma  1, 


(3.21) 


(3.22) 


RwFtP  <  0(s-(v °+**+'*+1>)  <  0(s-J). 


(3.23) 


Hence,  S  is  proper  for  all  admissible  R 


W 


Also,  since  5  =  Sk, 


lim  SAX 1 

S — *00 


L  st- 1 J 


(3.24) 


So  SM  in  (3.21)  is  strictly  proper  for  all  admissible  Rw.  Moreover,  the  matrix  M  is 
analytic  in  Re  s>0  since 


A^l[S  |  Rw]  =  [(. A !  -  KrB)  \  (Y1  +  K,A)\. 


(3.25) 


Hence,  SM{8V )  is  analytic  in  Re  s>0  and  strictly  proper. 

It  immediately  follows  from  the  generalized  Nyquist  theorem  (e.g.,  see  Chapter 
2  of  [46])  that  <f)$  is  free  of  zeros  in  Re  s>0  when  there  are  no  encirclements  in  the 
complex  s-plane  of  the  point  —1  +  j’O  by  the  characteristic  loci  for  S(juj)M(juj)(8V). 
Now  M  is  a  function  of  Rw  which  in  turn  is  a  function  of  Zwz.  So  one  should  seek  an 
allowed  Zwz  satisfying  (2.24)  for  which  there  are  no  encirclements  of  the  —  1  +j0  point 
by  the  characteristic  loci  for  the  largest  set  of  expected  perturbations  6V  (i.e.,  so  that 
the  stability  margin  is  maximized).  This  design  problem  defies  an  analytical  solution, 
however.  Hence,  an  alternative  design  approach  is  considered  here.  Specifically,  that 
allowed  Zwz  satisfying  (2.24)  is  sought  for  which 

1  ri° ° 

J  =  —7  /  Tr(5MSM*5*)ds 

Z7 TJ  J—joo 

is  minimized  where  with  S  denoting  the  expectation  operator 

E  =  £{(8V){6V)'}.  (3.27) 


(3.26) 
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Clearly,  in  this  formulation  the  perturbation  matrix  is  assumed  to  contain  elements 
which  are  zero-mean  random  variables  with  known  covariances.  When  this  is  not  the 
case,  one  might  take  E  =  /  or  one  might  replace  E  with  some  appropriate  weighting 
matrix.  The  advantage  of  including  E  in  J  is  that  when  E  is  known,  the  structure  of 
the  perturbations  is  in  some  way  included  in  the  design  process.  J  being  finite  for  all 
allowed  Zwz  is  a  consequence  of  the  fact  that  SM  is  analytic  in  Re  s>0  and  strictly 
proper  for  all  admissible  Rw. 

It  is  expected  that  this  alternative  approach  may  yield  designs  with  stability  mar¬ 
gins  that  are  significantly  better  than  those  for  the  optimal  performance  design  which 
corresponds  to  the  choice  Zwz  =  0.  This  expectation  is  based  on  the  anticipation 
that  designs  for  which  J  is  kept  small  are  designs  which  in  practice  tend  to  keep 
S(juj)M(ju)8V  small  so  that  encirclement  of  the  —1  +  jO  point  are  avoided. 

The  functional  J  is  a  special  case  of  the  functional 

1  rjoo 

J  =  —  /  Tr{W1MW2M*Wu)ds  (3.28) 

2l X]  J — joo 

where  W\  and  W2  satisfy  the  following  assumption. 

Assumption  10:  The  km  x  m  matrix  W\  is  analytic  in  0  <Re  s<  00  and  such  that 
WiAj'1  is  strictly  proper.  The  (n+m)x(n+m)  matrix  W2  is  a  proper  para-Hermitian 
matrix  which  is  analytic  and  nonnegative  definite  on  the  s  =  ju>  axis. 

In  order  that  the  theory  developed  here  be  applicable  to  situations  in  which  the 
modelling  of  the  perturbations  8A\,  8BX  is  done  differently  and/or  the  weighting 
matrix  E  in  (3.26)  is  chosen  differently,  all  results  in  the  sequel  are  developed  for  the 
more  general  case  of  (3.28).  Specifically,  a  formula  is  derived  which  gives  the  strictly 
proper  Zwz  analytic  in  Re  s>0  and  satisfying  (2.24)  for  which  (3.28)  is  minimized. 

The  practicality  of  the  perturbation  model  assumed  in  (3.17)  thru  (3.19)  is  justified 
next.  In  most  applications,  FtP  is  strictly  proper.  When  A\  is  column  reduced,  it 
then  follows  that 

lim  FtP  =  lim  B\Sll A7}  =  0.  (3.29) 

S— ►  OO  5— >00 

Now  (3.29)  holds  if,  and  only  if, 

}}rnB1S;1  =  0  (3.30) 

or  the  column  degrees  of  B\  are  less  than  the  corresponding  column  degrees  of  A\. 
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(3.31) 


In  this  case,  it  is  always  possible  to  write 
Bx  =  ^iloBuSi 

and 

SB,  =  E toSBuSi 

which  agrees  with  the  second  block  row  of  (3.17). 

The  justification  for  taking  8  Aik  =  0  is  more  involved.  When  6 Aik  ^  0,  then 


(3.32) 


8V  = 


SVa  SAn 
SVB  0 


(3.33) 


where 


'  SVA  ' 

SAio 

8  An 

■  •  •  8Ai'k-i 

SVB 

SBi  o 

SBn 

•  •  •  8Bitk-i 

(3.34) 


and 


'  8Ai  ' 

SBi 

SVa  8 A\k 

8Vb  0 


S 

Sk 


(3.35) 


Using  (3.35)  in  (3.3)  and  invoking  (3.20)  now  yields 
<j>s  =  det(I  +  M) 

where 


(3.36) 


M  = 


5 

Sk 


Ai1  S  Ru 


SVA  8Aik 
8Vb  0 


(3.37) 


Since  under  Assumptions  7  and  8,  Rw  and  FtPRw  are  strictly  proper  for  all  ad¬ 
missible  Rw , 


lim 

S — ►OO 


S  Ru 


I  0 


(3.38) 


It  should  now  be  clear  when  (3.24)  is  recalled  that 

0 


lim  M  = 

3— ►  OO 


A-1 


i  o 


SVA  SAik 
SVb  0 


(3.39) 
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or 


M.{  oo) 


0  0 
A^6Va  A^SAik 


(3.40) 


So  with  SAik  -j-  0,  AA.  is  not  strictly  proper  and  any  H2  norm  associated  with  this 
matrix  is  infinite. 

Obviously,  8A\k  =  0  is  needed  for  the  theory  developed  here.  On  the  other  hand, 
since  8A\k  impacts  primarily  on  the  high  frequency  behavior  of  M.  and  since  from 
(3.40)  the  eigenvalues  of  A4(oo)  are  all  less  than  unity  in  magnitude  when 

MSMull  <  Prfllll^H  <  1,  (3.41) 


it  is  clear  that  the  impact  of  non-null  8A\k  on  encirclements  of  the  —1  +  j 0  point 
by  the  characteristic  loci  of  M  is  small  when  the  inequality  in  (3.41)  is  valid.  So 
the  designs  proposed  here  are  more  robust  to  parameter  variations  that  affect  the 
low  frequency  behavior  of  the  plant.  But  this  is  usually  desired,  since  loop  gains 
are  typically  small  in  the  higher  frequency  range.  The  problem  formulation  is  now 
complete. 


4.  The  Analytical  Solution 


The  constrained  optimization  problem  associated  with  (2.24)  and  (3.28)  is  solved 
by  introducing  a  Lagrangian  [16,17]  multiplier  a 2  and  minimizing  first 

</  =  «/  +  G:2A  Ewz.  (4.1) 

This  yields  the  minimizing  Zwz  as  a  function  of  a2.  The  parameter  a 2  is  then  chosen 
so  that  (2.24)  is  satisfied.  It  is  shown  in  the  Appendix  A  that  J  and  A Ewz  are  a 
monotonically  increasing  and  a  monotonically  decreasing  function  of  a2,  respectively. 
Also,  it  should  be  clear  that  minimizing  J  is  equivalent  to  minimizing 

X\J  =  X\J  +  A2A  Ewz  (4.2) 


when 
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(4.3) 


A,  = 


a 


1  +  a2 


1  +  a5 


=  1- Ax. 


So  the  set  of  all  Zwz  that  minimize  J  for  0  <  a2  <  oo  is  the  same  set  that  minimizes 
A iJ  with  Ai  >  0,  A2  >  0  and  Ai  +  A2  =  1.  Hence,  as  described  in  [31,32],  the  set  of 
solutions  Zwz  is  the  set  of  Pareto  optimal  solutions  for  the  multiple  objective  criteria 
J  and  A Ewz.  In  fact,  after  some  aditional  observations  described  below,  it  is  possible 
to  easily  derive  a  formula  for  this  set  of  Zwz  by  using  equation  (4.5)  in  [31]  and  setting 
a2  =  ^  in  the  result.  This  is  done  in  Appendix  B. 

Because  of  the  properties  of  W2  included  in  Assumption  10,  one  can  always  write 


W2  =  H2H2 * 


(4.4) 


where  H2  is  proper  and  analytic  in  Re(s )  >  0.  It  then  follows  from  (3.28)  that 

a  1  /*joo 

J=\\Ti\\l  =  iz-  Tr(TiTu)ds  (4.5) 

Z7TJ  J—j oo 

where 


Ti  =  WxMH2. 


(4.6) 


It  is  also  obvious  from  (2.24)  that 


a  ewz  =  \\n\\i 


(4.7) 


where 


?2  —  Zwz 


(4.8) 


Moreover,  the  matrix  M  in  (4.6)  can  be  expressed  in  terms  of  Zwz  with  the  aid  of 
(2.20)  and  (3.22).  From  (2.20), 


Rw  =  Rw  +  AiA  1  ZwzQ.  1 


A 

0 


(4.9) 


Then,  when  (2.4)  is  recalled, 


S  =  I-  RwFtP  =  I  -  RwFtP  -  AlA~1Zwzn-1 


B 

0 


(4.10) 
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Thus,  (3.22)  gives 

M  =  A^1  [  S  Rw  ]  =  1  + 

(4.11) 

where 

*i  =  A?[{I-RwFtP)\Rw), 

(4.12) 

i-H 

1 

< 

II 

<M 

(4.13) 

and 

T  ^-1  \-B  A' 

*»  =  n  [  0  oj- 

(4.14) 

Hence  (4.6)  is  equivalent  to 

T\  =  R\  —  U 1  Zwz  V\ 

(4.15) 

where 

Ri  =  Wi*iH2, 

(4.16) 

Ui  =  -W&  2, 

(4.17) 

and 

Vi  =  V3H2. 

(4.18) 

Similarly,  (4.8)  is  of  the  form 

T2  =  R2  —  U2ZwzV2 , 

(4.19) 

with 

R2  —  0,  U2  =  -I,  V2  =  I. 

(4.20) 

Clearly,  R2,  U2,  and  V2  are  proper,  real,  rational  matrices  which  are  analytic  in 
Re(s)  >  0.  It  is  also  established  in  the  sequel  that  the  same  is  true  of  Ri,  Ui,  and 
Vi.  In  addition,  both  R\  and  R2  are  strictly  proper.  As  a  consequence,  minimizing 
(4.2)  or 
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(4.21) 


A J  =  -  /wA.MII)  +  IlM  -  \fhUA, v2\\l 

with  respect  to  Zwz  is  a  special  case  of  the  problem  already  solved  in  Sections  III  and 
IV  of  [31].  Here  Zwz  plays  the  role  of  the  matrix  Q  in  [31]  and  the  fact  that  R\  and 
R2  are  strictly  proper  guarantees  that  Zwz  will  also  be. 

That  Ri,  U\,  and  V\  have  the  properties  described  above  is  established  with  the 
aid  of  the  following  lemmas  and  one  additional  assumption. 

Lemma  3:  Wi^2  <  O  ^_t/2+1^. 


Proof:  From  (4.13), 

2  =  Wt A"1  =  (Wi^K^A-1).  (4.22) 

From  Lemma  1,  AiA-1  <  0(s~1'2).  Since  W1A71  is  strictly  proper  by  Assump¬ 

tion  10,  it  immediately  follows  that  the  lemma  is  true. 


Lemma  4:  With  v 4  =  max{0,  —^0},  \L3  <  O  . 


Proof:  From  (2.4)  and  (4.14), 


^3  =  ^ 


-1 


-AFtP  A 

0  0 


Q-'A 


-FtP  I 

0  0 


Since 


i)  f }  1A  <  0(s  Vl)  because  of  Lemma  1, 

ii)  F*P<  0(s-*)  because  of  assumption  7, 

it  immediately  follows  that  the  lemma  is  true. 


(4.23) 


Lemma  5:  $1,  \L2  and  $3  are  analytic  in  Re(s)  >  0. 

Proof:  Since  M  is  analytic  in  Re  s  >  0  because  of  (3.25)  and  since  it  is  clear  from 
(4.11)  and  (4.12)  that  'Ll  is  the  particular  value  of  M  corresponding  to  the  choice 
Rw  =  Rw,  it  follows  that  'Ll  is  analytic  in  Re  s  >  0.  The  matrix  'L3  in  (4.14)  is  analytic 
in  Re  s  >  0  because  Assumption  6  guarantees  that  Q_1  is  good  and,  therefore,  analytic 
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in  Re  s  >  0.  Finally,  attention  is  turned  to  \H2  =  A-1.  From  Assumption  5,  A»A 
is  nonsingular  on  the  finite  s  =  ju  axis.  The  product  A, A  is  also  good  because 
AU{P*P  +  kQ)Ai  =  (PAi)*(PAi)  +  kAuQAi  good  follows  from  Q  good  and  PA\ 
good.  That  Q  is  good  is  a  consequence  of  Assumption  4.  The  proof  that  PA\  is  good 
is  given  in  Section  II  of  [41].  So  A* A  is  good  and  nonsingular  on  the  finite  s  =  ju> 
axis.  It  immediately  follows  that  the  same  is  true  for  A  and  \H2  =  A-1.  Since  A-1  is 
already  analytic  in  Re  s  >  0,  it  follows  that  A-1  is  analytic  in  Re  s  >  0. 

The  final  assumption  needed  is 
Assumption  11:  v0  =  v\  =  v2  =  0. 

Remark:  Assumption  11  is  consistent  with  Assumptions  7  and  8  and  the  specifi¬ 
cations  in  practical  cases.  Typically,  FtP  is  proper  which  is  equivalent  to  v0  —  0. 
Also,  P  is  strictly  proper  and  kQ  is  proper  and  positive  definite  at  infinity.  Hence, 
(P»P  +  kQ)~l  is  proper  which  is  equivalent  to  v2  =  0.  Moreover,  the  matrix  $  in 
(2.16)  is  typically  proper  and  positive  definite  at  infinity  so  that  4>-1  is  proper.  This  is 
equivalent  to  V\  =  0.  Finally,  it  should  be  noted  that  the  results  derived  in  the  sequel 
are  derived  with  a  somewhat  less  restrictive  Assumption  11  in  [47]  using  standard 
Wiener- Hopf  arguments.  Since  Assumption  11  is  satisfied  in  most  practical  applica¬ 
tions,  however,  the  simpler  derivation  based  on  the  already  existing  results  in  [31]  is 
given  here. 

All  is  now  in  place  to  readily  confirm  the  properties  of  Ri,  U\  and  Vi  cited  above. 
By  construction  H2  is  proper  and  analytic  in  Re  s  >  0.  Hence,  Ri  in  (4.16)  is  analytic 
in  Re  s  >  0  and  strictly  proper  when  the  same  is  true  of  W\^\.  Now  Wi  is  analytic 
in  Re  s  >  0  by  assumption  10  and  is  analytic  in  Re  s  >  0  by  Lemma  5.  Hence, 
Ri  is  analytic  in  Re  s  >  0.  Also, 


W1V1  =  HW[(/  -  RwFtP)\Rw).  (4.24) 

Now  W1A71  is  strictly  proper  by  Assumption  10.  Because  of  Assumptions  7  and 
11  and  Lemma  1,  it  is  also  true  that  Rw  and  RwFtP  are  strictly  proper.  Hence,  it 
immediately  follows  from  (4.24)  that  and,  therefore,  R\  is  strictly  proper. 

It  is  obvious  that  U\  in  (4.17)  is  proper  because  of  Assumption  11  and  Lemma  3. 
It  is  also  clear  that  U\  is  analytic  in  Re  s  >  0  since  W\  is  by  Assumption  10  and  ^2 
is  by  Lemma  5.  Finally,  Vi  is  proper  and  analytic  in  Re  s  >  0  when  'V3  is  because 
of  the  properties  of  H2.  Clearly,  ^3  is  analytic  in  Re  s  >  0  by  Lemma  5.  Also,  ^3  is 
proper  since  with  Assumption  11  one  has  U4  =  zq  =  0  in  Lemma  4. 

It  is  now  a  straightforward  process  to  prove  the  following  theorem  which  represents 
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the  key  result  of  this  section. 


Theorem:  When  a  /  0  and  Assumptions  1  thru  11  are  satisfied,  the  functional 

a  1  ri°° 

J  =  —  /  [Tr{W1MW2M*Wu)  +  a2Tr{ZwzZwz*)]ds  (4.25) 

Z7 TJ  J—joo 

with 

M  =  'V1  +  V2Zwzy3  =  A;1[(I-RwFtP)\Rw}  +  A-1Zwzn-1  ^  (4.26) 

is  minimized  over  the  set  of  all  strictly  proper  Zwz  analytic  in  Re  s>0  if,  and  only  if, 
Zwz  is  chosen  so  that 

vtcZwz  =  z0  =  V_1{V*M+  (4-27) 

where  y  is  a  Wiener-Hopf  spectral  solution  to  the  equation 

V  =  (tfaWs,)'  ®  (tfa.Wi.WW  +  a2 1  =  y*y,  (4.28) 

and  where 

v  =  vecV  =  -vec(«2*Wi*Wi^iW^«3*).  (4-29) 

Proof:  See  Appendix  B. 
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5.  The  2DOF  System 


It  should  be  clear  from  Fig.l  that  when  there  is  no  feedforward  transducer  (i.e., 
Lt= 0)  one  should  expect  that  the  results  obtained  for  the  3DOF  system  reduced  to 
a  2DOF  system  design.  That  is,  one  should  expect  that  Cz=  0  is  obtained.  It  is 
demonstrated  in  this  section  that  this  is  indeed  the  case.  In  the  process,  the  formulas 
for  the  2DOF  system  design  are  derived. 


It  follows  from  (2.3)  that  Cz—  0  is  equivalent  to  Rz  =  0.  It  has  already  been 
pointed  out  in  [1]  that  Rz  =  0  when  Lt  =  0.  Hence,  in  light  of  (2.20),  it  only  remains 
to  show  that 


6Rz  =  Ai\-1[Zv,\Za]CT1 


0 

I 


=  0. 


(5.1) 


The  first  step  entails  the  derivation  of  a  useful  formula  for  H. 
easily  finds  from  (2.15),  (2.16),  and  (2.19)  that 


With  Lt= 0,  one 


flfl* 


A(FtGdsFt*  +  Gm)A*  0 
0  Gi 


Hence, 


n  = 


fix  0 

0  n2 


(5.2) 


(5.3) 


where  Hi  and  fl2  are  Wiener-Hopf  spectral  solutions  to 


—  A(FtGdsFt*  4-  Gm)A * 


(5.4) 


and 


£12^2*  =  G\. 


(5.5) 


Substituting  (5.3)  into  (5.1)  then  yields 

8RZ  =  Axk-lZz^1.  (5.6) 

The  next  step  entails  the  derivation  of  useful  formulas  for  Zw  and  Zz  starting  with 
(4.27).  From  (4.14)  and  (5.3), 
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^3  =  [  ^  0' 

where 

£3  =  or1  [-B  A  . 

It  is  now  convenient  to  introduce 

A  = 

and 


B  = 

Then  (5.7)  yields 

B= [ ®  0 1 
0° 

where 

B  =  ty3W2  #3*, 

and  it  follows  from  (4.28)  that 

T>_  [  (B'®A  +  a2I)  0 

[  0  a2/ 

and 


V  = 


V  0 
0  al 


(5.7) 

(5.8) 

(5.9) 

(5.10) 

(5.11) 

(5.12) 


where  V  is  a  Wiener-Hopf  spectral  solution  to 

V»V  =  B'  ®  A  +  a2 1  =  ($3^k)'  ®  +  a2 1.  (5.13) 

The  final  piece  of  information  needed  for  (4.27)  is  v  which  follows  from  (4.29). 
Clearly,  (5.7)  leads  to 
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V  0 


(5.14) 


V  =  -^2*WUW1^1W2^3*  = 


where 


V  =  -<a2*W1,W^1W^.  (5.15) 

Using  careful  bookkeeping  and  recalling  that  FtP  =  A_1B  =  B\  A^1  is  an  nxm 
matrix,  one  can  show  that  Zw  and  V  are  mxn  matrices  and  V  is  an  nmxnm  matrix. 
Hence, 


v  =  vecV  = 


I 

A  / 

0 


where  the  vector 


v  = 


V\  V2 


vecV 


has  nm  rows.  Substituting  (5.12)  and  (5.16)  into  (4.27)  then  gives 
Vec[Zw  \ZzJ  — 


V  1{v/«}+ 


0 


from  which  it  follows  that 


vecZw  =  v  ^V*1^ 


and 


vecZz  =  0. 


(5.16) 

(5.17) 

(5.18) 

(5.19) 

(5.20) 


Hence,  (5.6)  gives  SRZ— 0  and  (5.1)  is  confirmed. 

The  Zw  given  by  (5.19)  is  denoted  by  Zwq.  The  associated  expression  for  Rw  is 
obtained  from  (2.20),  with  Zz=  0  and  fl  given  by  (5.3).  One  gets 


Rw  —  Rw o  —  Rw  +  ^4iA  Zw qHj  A 


(5.21) 


where 


-  -l  -  -l 

vecZw o  =  V  { V*  W+- 


(5.22) 
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Using  (5.21)  in  (2.13)  then  gives  the  Cw  in  the  2D0F  controller 

r(s)  =  (7u(s)uo(5)  -  Cw(s)w(s)  (5.23) 

that  results. 

Since  the  functional  (4.25)  reduces  to 

a  1  ri°° 

J  =  —  /  [Tr(W1MW2M*Wu)  +  ct2Tr(ZwZw)\ds  (5.24) 

Z7TJ  J—joo 

with 


M  =  $1  +  ^2ZW4>3  (5.25) 

when  Zz= 0,  it  follows  that  (5.24)  is  minimized  when  Rw—Rw o  and  Zyj - ZWQ  .  This  is 

indeed  the  result  one  would  seek  in  the  robust  design  of  a  2DOF  controller. 

It  should  be  noted  that  the  above  results  are  independent  of  Gi  so  long  as  As¬ 
sumption  6  and  $-1  <  0(s~2ui)  in  Assumption  7  are  satisfied.  Now  (5.2)  is  the 
expression  for  Clearly,  all  that  is  needed  for  the  satisfaction  of  Assumption  6 

is  that  where 

$  =  FtGdsFt*  +  Gm,  (5.26) 


be  analytic  and  nonsingular  on  the  finite  part  of  the  s  =  ju>  axis.  For  then  one  can 
always  pick  a  Gi  so  that  the  same  is  true  for  A4>A*.  With  regard  to 


4)-1  o 
0  Gjl 


<  0(s~2l/1) 


(5.27) 


all  that  is  needed  is  $  1  <  0(s  21/1 )  for  one  can  always  pick  a  G\  satisfying  Gi 1  < 

0(8 ~2l/1). 

As  a  consequence  of  the  above  results  and  observations  it  is  now  possible  to  state 
without  any  further  proof  the  following  corollary  to  the  main  theorem. 


Corollary:  When  a  ^  0  and  Assumptions  1  thru  11  are  satisfied  with  A$A*  replacing 
A$A*  in  Assumption  6  and  with  4>  replacing  $  in  Assumption  7,  the  functional 

a  1  n  oo 

J  =  —  /  [Tr(W1MW2M*Wu)  +  a2Tr{ZwZw*)]ds  (5.28) 

Z'K J  J—joo 
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with 


M  =  +  q2ZwV  3  (5.29) 

is  minimized  over  the  set  of  all  strictly  proper  Zw  analytic  in  Re  s>  0  if,  and  only  if, 
Zw  is  chosen  so  that 

vecZw  =  V_1{V:^}+  (5.30) 

where  y  is  a  Wiener-Hopf  spectral  solution  to  the  equation 


V, V  =  ®  (*2.Wi.W,®2)  +  a2/ 


(5.31) 


and  where 


v  =  vecV  =  — 


(5.32) 


6.  Computational  Issues 


Once  Rw  and  Rz  have  been  determined,  the  final  step  in  the  design  process  can 
be  taken.  Specifically,  the  controller  transfer  matrices  Cw  and  Cz  can  be  computed. 
From  (2.2)  and  (2.3),  one  gets 

Cw  =  (I  —  RwFfP)-1  Rw  (6.1) 

and 

Cz  =  {I-RwFtP)~lRz.  (6.2) 

These  formulas,  however,  involve  a  number  of  cancellations  that  must  be  assured 
despite  roundoff  error  in  the  numerical  computation.  Failure  to  enforce  these  theo¬ 
retically  required  cancellations  can  lead  to  a  synthesized  controller  with  poles  (zeros) 
where  the  plant  has  zeros  (poles)  in  Re  s>0.  In  such  cases,  the  system  is  unstable 
because  of  unstable  hidden  modes. 

The  best  way  to  proceed  is  to  recognize  from  (2.20)  that  Rw  and  Rz  are  of  the 
form 
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Ryj  -  A\Hyj 


(6.3) 


and 


Rz  =  A,Hz  (6.4) 

where  Hw  and  Hz  are  analytic  in  Re  s>  0  since  the  formulas  for  Rw  and  Rz  given  in  [1] 
are  also  of  this  form.  So  one  would  compute  Hw  and  Hz  separately  and  then  recognize 
that  the  substitution  of  (6.3)  and  (6.4)  into  (6.1)  and  (6.2)  yields,  respectively, 

Cw  =  D~lHw  (6.5) 

and 

C.=DZ'H,  (6.6) 

where  in  view  of  (2.4) 

Dw  =  A?  {I  -  RwFtP )  =  (/  -  HwBx)A-\  (6.7) 

It  is  important  to  recognize  that  additional  cancellations  occur  in  the  computation 
of  Dw.  Since  Rw  is  acceptable,  it  follows  from  (2.7)  and  (6.3)  that 


HW  =  Y1  +  KXA.  (6.8) 

Using  (2.5c)  and  (6.8)  in  (6.7),  one  gets 

Dw  =  [. I-(Y1+K1A)B1]A r1 
=  [I-Y1B1-K1AB1}A^1 

=  [XiAi  -  K1BA1]AX1  =  Xx-  K\B.  (6.9) 

Since  Xx  and  B  are  polynomial  matrices  and  K\  is  analytic  in  Re  s  >  0,  it  follows 
that  Dw  is  analytic  in  Re  s  >  0.  Thus,  when  using  the  final  result  in  (6.7)  to  calculate 
Dw ,  one  must  take  care  to  assure  that  no  poles  of  Axl  in  Re  s  >  0  are  poles  of  any 
element  of  Dw.  This  is  accomplished  for  each  element  of  Dw  by  dividing  the  factor 
containing  the  poles  of  Axl  in  Re  s  >  0  into  the  numerator  polynomial  and  ignoring 
any  remainder  polynomial  as  being  due  to  roundoff  error. 
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7.  A  Numerical  Example 


A  2D0F  design  is  carried  out  in  this  section  to  illustrate  the  advocated  design 
methodology  for  trade-off  between  performance  and  stability  margin.  The  following 
single-input-single-output  plant  is  considered  for  a  pure  servo  problem 


P(s)  = 


s  —  1 
s(s  —  2)  ’ 


(7.1) 


The  spectral  densities  of  the  signals  are  Gu  =  —  and  Gd  =  Gm  =  Gn  —  0,  i.e.,  no 
disturbance,  input  noise,  or  measurement  noise.  The  first-order  plant  perturbations 
are  accounted  for  by  the  plant  uncertainty  spectral  density 


G,(a) 


<r2(l-s2) 
s2(s2  -  4) 


(7.2) 


and  the  constants  are  taken  a,s  k  =  Q  =  Ft  =  To  =  [i  =  <?  =  1-  Therefore,  Gds  =  Gs. 
The  optimal  2D  OF  design  has  been  carried  out  in  [2]  and  Rw  is  given  by 


s(s  —  2)[(8  +  3-\/7)s  —  1] 
( s  +  l)(s2  +  Vis  +  1) 


(7.3) 


This  Rw  is  realized  with 

/V  _  (8  +  3\/7)s  —  1 

.-(5  +  2^)' 

For  this  design, 

Ew  =  68.395 


(7.4) 


(7.5) 


which  is  the  minimum  value  of  the  cost  functional 

1  rjoo  1  rjoo 

Ew  =  - — :  /  Tr(RwGsRw *)  ds  +  —  Tr[{I  -  PRW)GS(I  -  PRW)*\  ds.{ 7.6) 

Equation  (7.6)  follows  from  (2.12)  with  the  given  data  for  this  example.  It  should  be 
noted  that  this  Ew  reflects  only  first-order  plant  uncertainty  so  that  the  choice  of  a 
non-zero  Zw  for  improved  stability  margin  corresponds  to  an  increased  sensitivity  to 
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“small”  plant  uncertainty  as  measured  by  the  cost  increment  AEW.  Moreover,  with 
Zu  =  0  no  change  in  tracking  and  plant  saturation  cost  occurs. 

The  case  considered  is  one  in  which  the  normal  operating  profile  involves  only 
small  perturbations  in  the  numerator  polynomial  coefficients  as  modelled  by  (7.2). 
However,  an  infrequent  abnormal  operating  profile,  which  results  in  a  significant 
change  in  the  plant  poles  at  s=0  and  s=2  and  the  plant  zero  at  s=l,  is  expected.  In 
order  to  provide  more  stability  margin  for  the  abnormal  case,  a  trade-off  of  optimal 
performance  under  normal  operating  conditions  for  greater  stability  margin  in  the 
abnormal  case  is  considered  here  through  a  nonzero  choice  of  Zw.  It  is  assumed 
that  the  plant  zero  and  poles  change  in  a  fashion  such  that  only  8A\ o  and  8B\q  are 
nonzero.  Then,  with  8P  equal  to  the  first  block  column  of  (3.18),  it  follows  that 
(3.19)  reduces  to  S  —  So  =  1  in  (3.17).  Moreover,  with  8A\o  and  8B\o  independent 
zero-mean  random  variables  each  with  unity  variance,  the  matrix  S  in  (3.27)  is  an 
identity  matrix.  So  in  (4.25),  W\  =  <5  =  1  and  W2  =  £  =  /.  It  now  follows  in  a 
straightforward  fashion  from  (5.31)  since 


Ms*  = 


s4  -  5s2  +  1 
<r2(l  -  s2) 


and 


$2,  $2 


1 

s4  —  5  s2  +  1  ’ 


that 


acrs  +  VTT  a2a2 
cr(s  +  1) 


Furthermore, 


(7.7) 


(7.8) 


(7.9) 


.  =  (8  +  3 y/?>  -  (5  +  2y/7) 
a(s2  —  1)(^2  —  s  A  1) 

It  then  follows  from  (5.30)  that 


acrs  +  a/1  +  a2cr2 


=  -vec^.Ms,)- 


(7.10) 


(7.11) 


where 
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(7.12) 


_  13  +  5\/7 

(2  +  y/7  )(aa  +  \/l  +  a2  a2) 

The  trade-off  in  performance  and  stability  margin  obtained  through  different 
choices  of  a  is  summarized  in  the  following  table  where  the  values  listed  for  the 
square  root  of 

„  1  ri°° 

\\M\\\  =  —  /  Tr(MM.)  ds  =  J  (7.13) 

Zttj  J —joo 

and  for 

||Af  ||oo  =  sup  a(M(ju>)),  (7.14) 


where  <r(.)  denotes  the  maximum  singular  value  of  (.),  are  the  ones  obtained  with  the 
listed  Cw  that  was  obtained  using  the  design  methodology  described  in  this  paper. 
The  last  two  columns  are  explained  later.  Clearly,  by  giving  up  performance,  a 
reasonable  gain  in  the  stability  margin  may  be  achieved.  The  first  row  in  the  table 
corresponds  to  the  optimal  design  (7.4).  As  expected  ,  the  smaller  a  is,  the  better 
is  the  stability  margin  and  the  greater  is  the  degradation  in  performance.  The  a 
corresponding  to  a  100%  increase  in  cost,  namely  a*,  is  0.165393.  In  this  case,  5.24 
db  improvement  in  stability  margin  is  achieved  as  measured  by  HMHoq. 


As  noted  earlier,  a  quadratic  measure  was  chosen  as  an  approximate  measure  of 
stability  margin  in  this  paper.  To  show  the  efficacy  of  this  design  methodology,  a 
numerical  optimization  was  carried  out  to  evaluate  the  best  Hoo  design  for  each  row 
in  Table  1.  This  optimization  was  carried  out  over  the  set  of  first  order  Zw,  namely 
Zlj,  that  give  the  listed  value  of  A Ew  since  the  optimal  solution  for  Zw  in  the  H2 
setting  was  also  first  order.  Specifically,  the  optimization  was  carried  out  over  Zw  of 
the  form: 

Zw  =  a 2  >  0,  (7.15) 

s  +  a  2 

and  a\  =  ±\/2a2AEw  in  order  to  assure  that  1 1 2^^  1 1 1  =  A Ew. 

The  last  column  in  Table  1  lists  the  stability  margin  for  the  best  solution  over 
the  set  of  allowable  first  order  Zw  when  a\  is  negative.  The  column  before  the  last  is 
the  same  as  the  last  column  except  that  the  parameter  a\  is  now  positive.  Obviously, 
the  attained  stability  margins  for  the  H2  solutions  are  reasonably  close  to  the  H <*, 
solutions.  In  all  cases  shown  for  a  positive  oq,  the  H2  solutions  yielded  an  improvement 
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a 

A  Ew 

(%of  Ew) 

IWU 

W 

||M||2 

(4b) 

mfzl  HMIU 
(db)  -  oq  >  0 

00 

0 

(0%) 

20.29 

14.187 

z=0.063,  p=10.292 
g=15.937 

20.29 

20.29 

1.934 

(2.83%) 

18.786 

13.337 

z=0.077,  p= 12.216 
g=18.276 

17.74 

16.66 

71.006 

2.611 

(3.82%) 

18.570 

13.213 

z=0.080,  p=12.590 
g=18.731 

17.44 

16.66 

13.064 

z=0.083,  p=13.083 
g=19.329 

5.056 

(7.39%) 

18.003 

12.885 

z=0.087,  p=13.747 
g=20.137 

16.63 

16.66 

7.302 

(10.67%) 

17.636 

12.670 

z=0.091,  p=14.675 
g=21.265 

16.12 

16.66 

0.5 

79.285 

10.890 

(15.92%) 

17.194 

12.410 

z=0.098,  p=16.034 
g=22.916 

15.49 

16.66 

0.4 

85.352 

16.957 

(24.79%) 

16.671 

12.098 

z=0.106,  p=18.155 
g=25.493 

14.74 

16.66 

28.168 

(41.18%) 

16.052 

11.723 

z=0.116,  p=21.813 
g=29.939 

13.83 

16.66 

120.910 

52.515 

(76.78%) 

15.327 

11.277 

z=0.131,  p=29.335 
g=39.080 

12.72 

16.66 

a* 

136.790 

— j 

11.103 

z=0.137,  p=34.117 
g=44.891 

12.28 

16.66 

0.1 

198.237 

129.879 

(189.9%) 

14.490 

10.747 

z=0.150,  p=52.335 
g=67.031 

11.36 

16.66 

Table  7.1:  Suboptimal  designs  for  improved  stability  margin  for  various  as. 


in  stability  margin  which  was  at  least  59%  of  that  obtained  with  the  associated 
solution.  Moreover,  when  a  better  solution  than  the  H2  solution  is  desired,  then  the 
H2  solution  provides  a  good  initial  point  for  an  iterative  search  for  the  solution. 
A  larger  gain  in  stability  margin  may  be  achieved  when  a  >  0.8  if  negative  values 
of  ai  are  permitted.  However,  in  this  case,  the  controller  needed  is  second  order  and 
the  larger  stability  margin  is  realized  at  the  expense  of  an  unreasonably  fast  pole  in 
the  controller.  Therefore,  the  resulting  controller  is  not  a  practical  one.  Actually, 
this  is  another  reason  why  an  approximate  measure  of  stability  margin  may  be  more 
suitable  since  a  controller  with  a  very  high  bandwidth  may  be  needed  if  one  insists 
on  achieving  the  maximum  possible  stability  margin.  In  one  example,  by  giving  up  a 
small  amount  of  stability  margin,  one  could  reduce  the  controller  bandwidth  required 
to  a  more  practical  value. 
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Additional  insight  regarding  the  solutions  obtained  can  be  gained  from  an  exam¬ 
ination  of  the  stability  region  in  the  Mio,  <5 -Bio  parameter  space.  The  closed-loop 
characteristic  polynomial  for  this  example  with  first  order  controllers  of  the  form 

Cw(s)  =  g —  (7.16) 

s  —  p 


is  given  by 

Ac(s)  =  (s2  —  2s  +  6Aio)(s  —  p)  +  g(s  —  z)(s  —  1  +  SBio).  (7.17) 

Appealing  to  the  Routh  test  for  the  above  polynomial  gives  as  the  necessary  and 
sufficient  conditions  for  asymptotic  stability: 


g-p-2  > 

G 9  ~  2)  <5 Aio  +  g(g  —  p  +  z  —  2)  SB\o  + 
-  6Aw-gz  6Bw  +  gz  > 


0  (7.18a) 

{g-p-  2)[2p  -  g{\  +  z)\  -  gz  >  0(7.18b) 
0.  (7.18c) 


The  inequality  (7.18a)  is  satisfied  automatically  since  all  the  controllers  stabilize 
the  nominal  plant.  The  last  two  inequalities  establish  the  range  of  allowable  per¬ 
turbations  in  the  plant  parameters  and  are  linear  in  the  unknown  parameters  SAW 
and  6 Bio-  The  set  of  parameters  for  which  the  closed- loop  system  is  asymptotically 
stable  is  shown  in  Figure  7.1  (the  hatched  area).  This  set  corresponds  to  the  optimal 
controller  (i.e.,  a  =  oo).  When  perturbations  along  any  ray  drawn  from  the  origin  are 
equally  likely,  the  shortest  distance  from  the  origin  to  the  boundary  of  the  stability 
region  defines  the  stability  margin.  This  distance  is  the  radius  of  the  circle  shown  in 
Figure  7.1.  This  radius  is  in  agreement  with  the  value  of  UMH^1  obtained  from  Table 
1  for  the  a  =  oo  case  and  is  the  minimum  distance  from  the  origin  to  the  line  h- 
The  constraint  (7.18c)  is  indeed  the  limiting  factor  in  this  example.  As  parameter  a 
decreases,  the  line  I2  rotates  counter-clockwise  about  the  point  (0,1);  therefore,  the 
stability  margin  increases.  The  parameter  set  for  a*  =  0.165393  is  given  in  Figure 
7.2.  In  this  case,  the  stability  margin  is  larger  by  a  factor  of  about  2.  The  circle  with 
the  smallest  radius  corresponds  to  the  optimal  design.  The  second  circle  corresponds 
to  the  suboptimal  design  and  the  largest  circle  corresponds  to  the  best  HrXj  design 
constrained  to  the  set  of  first  order  Zw.  It  should  be  pointed  out  that  it  is  inherently 
difficult  to  achieve  a  good  stability  margin  for  this  plant  [2]  since  it  is  unstable  and 
non-minimum  phase  and  the  relative  distance  between  the  right-half-plane  zero  and 
pole  is  small.  Never-the-less,  a  substantial  relative  improvement  is  possible  when  a 
trade-off  of  performance  can  be  considered. 
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Figure  7.1:  Allowable  parameter  space  for  the  optimal  H2  design. 


Figure  7.2:  Allowable  parameter  space  for  the  suboptimal  H2  design. 

8.  Conclusion 

A  promising  control  design  methodology  for  trade-off  between  performance  and 
stability  margin  for  multivariable  systems  has  been  introduced  and  applied  to  a  scalar 
example.  To  attain  an  analytical  solution  to  this  fundamental  trade-off  problem,  a 
quadratic  measure  for  stability  margin  is  chosen.  Furthermore,  a  novel  approach  is 
introduced  which  allows  structured  perturbations  in  the  coprime  polynomial  matrix 
fraction  description  of  the  plant  transfer  matrix  to  be  taken  into  account.  The  ad¬ 
vocated  methodology  provides  a  good  initial  solution  from  which  better  solutions,  if 
needed,  may  be  searched  for  iteratively.  Future  work  will  include  applications  of  this 
design  methodology  to  practical  multivariable  examples  to  assess  the  efficacy  of  the 
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methodology  presented  here.  Needed  in  this  effort  are  efficient  numerical  algorithms. 
In  this  regard,  it  is  expected  that  [48-51]  will  play  an  important  role. 

Appendix  A 

The  cost  functional  (4.1)  is  of  the  form 


J  =  J  +  a2I<  (8.1) 

with 

K(Zwz)  =  A  Ewz.  (8.2) 

The  optimum  Zwz  is  the  one  which  minimizes  J  and  it  depends  parametrically  on  a. 
In  this  regard,  one  has  the  following  lemma. 

Lemma  Al:  J  is  a  monotonically  increasing  function  of  a2  and  A  is  a  monotonically 
decreasing  function  of  a2. 


Proof:  Suppose  with  a2  =  a2,  Zwz  —  ZWZl  is  the  unique  optimum  choice  and  for  this 
choice  J  —  J\  and  K  =  K\.  Similarly,  with  a2  =  ,  Zwz  =  ZWZ2  minimizes  J  and 

J  =  J2  and  K  —  K?  in  this  case.  Then, 


J\  +  0L2  K\  >  J2  olq2 K 2 

(8.3) 

and 

J2  T  Ci-y2 K-2  >  J\  T  OL-y2 K\. 

(8.4) 

Multiplying  (8.3)  with  o2  and  (8.4)  with  a22 

and  adding  the  two  expressions  yields 

«l2(Ji  -  J2)  >  a22(Jy  -  J2). 

(8.5) 

Therefore, 

(Ji-J2)(a!2-a22)>0. 

(8.6) 

This  implies  that 


[Ji  —  J2)  <  0  4^"  ol-]2  —  o.'i'  <  0 


[or  (Jx  -  J2)  >  0  a.2  -  a2  >  0).  (8.7) 


Therefore, 
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(or  Ji  >  J2  44>  <*i2  >  a22). 


Ji  <  J2  &  ai 2  <  a22 


(8.8) 


The  above  establishes  that  J  is  a  monotonically  increasing  function  of  a2.  From  (8.3), 


Jx-J2>  a2\K2  -  Kx). 


(8.9) 


Furthermore,  from  (8.7)  and  (8.9), 


c*i2  <  a22  44-  0  >  Jx  ~  J2  >  cl2\K2  —  ifi). 


(8.10) 


So, 


ax 2  <  a22  44  Ji  <  J2  and  Kx  >  K2. 

This  completes  the  proof. 


(8.11) 


Appendix  B 

The  proof  of  the  theorem  stated  in  Section  IV  is  given  in  this  appendix.  The  first 
step  is  to  recall  that  minimizing  (4.25)  with  respect  to  Zwz  is  equivalent  to  doing 
the  same  to  (4.21).  This  optimization  problem  is  a  special  case  of  the  one  solved  in 
[31].  Since  Rx  and  R2  are  strictly  proper  here  and  since  it  is  clear  from  (4.20)  with 
A2  ^  0  that  both  \f\2U2  and  V2  have  full  column  rank  everywhere  in  the  complex 
plane  (infinity  included),  it  follows  from  Theorem  3.4  in  [31]  that  one  can  always 
write  the  optimal  choice  for  Zwz,  which  plays  the  role  of  Q  in  [31],  as 

=  Zv,"  +  (8.12) 

where  the  constant  matrix  ZWZ(X>  —  0  and  ||^W2||2  exists. 

Now  all  that  remains  is  the  implementation  of  the  steps  given  in  Section  IV  of 
[31]  to  derive  the  formula  for 

z:2  =  ZW2.  (8.13) 

First  set 


vec(\A7i7i) 

yec(^/\^Wx^iH2) 

vec(\/A 2R2) 

0 

(8.14) 
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and 


'  v;  <g>  (VAit/i)  ‘ 

.  v'  0  {V\2U2) . 

-Vhi 

Next  recognize  that  W*  =  W  in  [31]  and  form 
WW  =  W,W  =  \XV 


(8.15) 


(8.16) 


where  T>  is  given  by  (4.28)  with 

a2  =  y  *  0.  (8.17) 

The  matrix  W*W  has  a  Wiener  spectral  factor  W0.  Hence,  so  does  V  and  the 
symbol  V  is  used  to  represent  it.  Clearly,  one  can  take 

W0  =  v^V.  (8.18) 

Associated  with  this  factorization  is  the  inner  matrix 

Wi  =  WW'1  =  -^=WV-\  (8.19) 

VAi 

Using  the  fact  that 

PhAW]t)  =  {Wt*r}+  (8.20) 

in  (4.5)  of  [31]  then  gives 

vec(Z:j=*„  =  W-'{H'i.r}+  (8.21) 


or 


V^W.r 


+ 


From  (8.14)  and  (8.15),  one  gets 

W„r  =  <g>  vec (/Wx^^). 


(8.22) 


(8.23) 
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The  identities  (see  [52]) 

( G\  ®  G2)*  —  Gu  <8>  G2* 


(8.24) 


and 


vec(G!iGr2G!3)  —  (Gg  ®  Gi)vec(G2)  (8.25) 

then  lead  to 

W*r  =  -vec  ((y/\1^2*W1,)(^Wl^lH2)(H2^3^  (8.26) 


or 


W*r  =  Am  (8.27) 

where  u  is  given  by  (4.29).  Substituting  (8.27)  into  (8.22)  gives  (4.27)  and  completes 
the  proof. 
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